: 


201960 
PROCEEDINGS 


A d REGISTERED No. M, 3121 


INDIAN ACADEMY 
OF SCIENCES 


SECTION A [No. 1 


JULY 1960 


Price Rs. 2 or 3 Sh. Annual Subscription Rs, 18 


| 
ay 
2 


NGAL 


AT TRE BA 


1 


ELASTIC BEHAVIOUR OF MATTER UNDER 
VERY HIGH PRESSURES 


Uniform Compression 


By S. BHAGAVANTAM, F.A.Sc. AND E. V. CHELAM 
(Director’s Research Laboratory, Indian Institute of Science, Bangalore) 


Received April 21, 1960 


1. INTRODUCTION 


THE elastic behaviour of substances, which are already in a highly strained 
state, is a subject of considerable importance, in its fundamental aspects 
as well as in its application to important questions like the constitution and 
stability of massive entities such as exist in the interior of the Earth. In a 
paper entitled ‘Elasticity and Constitution of the Earth’s Interior,’ Birch 
(1952) has given a comprehensive account of this problem. Matter in the 
interior of the Earth is subjected to large stresses, resulting in an accumu- 
lation of great amounts of strain energy. It is known that periodical 
releases of the strain energy, thus accumulated, manifest themselves as earth- 
quakes in various regions, although the exact causes thereof and the detailed 
mechanism of these energy releases, are still matters open for further study. 
It is also known that shear waves are not sustainable in the interior of the 
Earth below a depth of about 3,000 km., while up to that depth from the 
surface, compressional as well as shear waves are present. An explanation 
of these remarkable phenomena would necessarily involve a knowledge of 
the elastic behaviour of matter at the depths in question. Thus, the first 
step in this direction would be the evaluation of the elastic constants of 
substances, which are already under great strains, in terms of known 
parameters. 


When considering such large strains, one has to have recourse to the 
finite or non-linear theory of elasticity, an exact development of which in 
matrix form has been given by Murnaghan (1951). When the strain com- 
ponents are large, the stress at a point is no longer a linear function of the 
strain components at that point, but includes quadratic terms as well, and 
the strain energy function ¢ has to be extended to include at least the cubic 
powers of the strain components. The coefficients of the quadratic terms 
have been called ‘the Second Order Elastic Constants’ and of the cubic 
terms, ‘the Third Order Elastic Constants’. Birch (1947, 1952), Fumi 
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(1951), Hearmon (1953) and Bhagavantam (1949, 1957) have dealt with this 
problem and have derived the independent third order constants for the 
various crystal classes. In the present work, a reference to these investiga- 
tions will become necessary as the strain energy includes second as well as 
third order elastic constants. 


In studying the elastic response of a substance which is initially subjected 
to a finite strain, additional strains can always be visualized as infinitesimal 
in character, since the case of a finite strain superposed over another finite 
strain can be reanalyzed so as to correspond to an infinitesimal strain imposed 
on a finite one. As this additional strain is infinitesimal, its effect in the 
strain energy function will be limited to quadratic terms only, the coeffii- 
cients of which will be the elastic constants appropriate to the substance 
in the prestressed state. We shall designate these as the ‘effective elastic 
constants’. They completely specify the elastic response of a substance in 
a state of finite strain, when additional infinitesimal strains are superposed 
thereon. These effective constants will depend on the second and third 
order elastic constants of a stress-free state and also upon the state of initial 
stress. 


Expressions for such effective elastic constants can be derived by noting 
that the elastic energy of the infinitesimal deformation must be equal, after 
multiplication by a suitable magnification factor and allowing for the work 
done by the external forces, to the difference in the total energy of the 
system when subjected, firstly to the finite strain, and secondly to the finite 
plus infinitesimal strain considered as a single composite state of strain. 
The magnification factor arises because the strain energy always refers to 
a unit initial volume and this volume is different according as whether we 
choose our initial state as the stress-free state or as a state of finite initial 
stress. It will be observed from the details given in the following that this 
method of evaluating the elastic constants from the use of the energy dif- 
ference, adopted in the present paper, is much simpler than the method of 
evaluating them from the Stress Matrix—a procedure adopted by some 
earlier workers (Birch, 1947; Hughes and Kelly, 1953). There is a discre- 
pancy between the values obtained by us in this paper and by Birch (1947) 
and a discussion of the same is also given. 


It has to be mentioned here that Green and Zerna (1954) have con- 
sidered the question of superposition of an infinitesimal deformation over 
a triaxial finite strain for an isotropic substance but their treatment does 
not explicitly bring in the third order elastic constants. We have explicitly 
considered these constants and studied in detail the case of cubic symmetry 
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because such a case would pertain closely to substances which actually occur 
in nature. In the present paper we have also included, in a tentative and 
general way, the indications which the energy function would give regard- 
ing the anomalous behaviour of matter subjected to high pressures. Detailed 
discussions of this aspect involve the Theory of Dynamical Elasticity, a topic 
which we propose dealing with in a subsequent paper. 


2. FrntreE DEFORMATION THEORY 


We now proceed to give the formalism of the finite deformation theory 
to the extent required for our problem. By a deformation, we mean that 
the internal constituent points in a body have undergone displacements with 
respect to each other, so that their relative distances have altered. To specify 
such an alteration in the distance between a point P and every other point Q, 
R, S, etc., in its neighbourhood, we first note that the general expression for 
an infinitesimal length appears as a quadratic (ds* = dx? + dy* + dz?). 
If therefore ds, is the length PQ before deformation and ds the same length 
after deformation, half the alteration in the square of the length, (ds? — ds,*)/2 
is taken as a measure of the state of strain of the distance PQ in question. 
By assigning general positional co-ordinates to Q, the alteration in length 
not only of PQ but of every other length PR, PS, etc., in the neighbourhood 
of P is obtained, so that the state of strain around P is completely specified 
by the quadratic form (ds* — ds9*)/2. To derive an expression for this 
quantity, we note that, if as a result of the deformation, a point with initial 
co-ordinates (a, b, c) referred to any convenient space fixed Cartesian co- 
ordinate system moves over to a final position (x, y, z) referred, for con- 
venience to the original frame, we can introduce a displacement vector with 
components (u,v,w) given by x=a+u, y=b+v,z=c+w, where 
u,v,w are functions of the initial positional co-ordinates (a,b,c). If the 
point P, with co-ordinates (a, b,c) moves over, as a result of deformation 
to P’ with co-ordinates (x, y,z)=(a+u, b+ v, c+ w), the point Q in 
the infinitesimal neighbourhood of P with co-ordinates say (a + da, b + db, 
c+ dc) will move over to the position (x + dx, y+ dy, z+ dz) where 
dx, dy, dz are given by the relations 


ou ou ou 

dx =(1+ da+ db+ 
ov ov ov 

dy = 32 -da+ (1+ 32 de 


dw ow ow 
dz = dat + (1+ dc. (1) 
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The nine derivatives du/da, etc., completely specify the most general dis- 
placements, including rotations of the element as a whole, in which the 
relative distances of points within the element are not altered. There is 
therefore no room for play of the elastic forces within the element, and one 
could then remove the effect of rotations by the removal of the antisymmetric 
part in the equations by employing the usual methods. This will leave only 
6 independent combinations of the displacement derivatives to be con- 
sidered in the elastic deformation. If we take it that the rotations are zero, 
this will immediately require that du/db = dv/da, etc., so that the matrix 
of the transformation is symmetric as it stands with six independent compo- 
nents. In the linear or infinitesimal theory of elasticity, the strain com- 
ponents are identified with these six independent displacement derivatives 
du/da, dv/db, etc. In the finite or non-linear theory of elasticity, however, 
we work out (ds? — ds,”)/2 using relation (1) and noting that ds? = dx? 
+ dy? + dz? = square of length PQ after deformation and ds)? = da? 
+ db* + dc* = square of length PQ before deformation. This quadratic 
form therefore appears as 


ds? — _ 2 2 2 
+ + + 2n:2dadb + 2negdbdc 
+ 2ns,dcda (2) 


m= 3 + (3) + 


with similar ee for nee and 733, and 
ov , Ww OW 


with similar equations for 723, 73, etc. 


where 


The »’s are thus coefficients in a certain quadratic form expressing the 
change in the squared length and they constitute the 9 entities of a (3, 3) 
matrix styled as the strain matrix. In view of the symmetry of this matrix, 
the 9 components reduce to 6 and a convenient representation of the strain 


matrix is 


16 UT 
n= Ne Ne Na (5) 
15 Na Ns 


+ 
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It is evident from (3) and (4) that when the squares and products of the dis- 
placement derivatives du/da, etc., are neglected, the strain matrix of the 
finite theory becomes identical with the infinitesimal matrix. In view of 
the respective definitions, one can also say that in the infinitesimal theory, 
it is of no consequence whether the displacement derivatives are calculated 
at the initial or final positions of a point and the initial and final positions 
are thus interchangeable. In the finite theory, however, the distinction 
between the initial and final states has to be maintained. 


The displacement of a point from an initial position (a, b, c) to the final 
position (x, y, z), as given in equation (1), is most conveniently represented 
by the transformation x = Ja, (x = x, y, z), (a = a, b, c) and J, the Jacobian 
of the transformation is given from (1) by 


dx dx dx 
da db da (db 

J= ov dv 6 
da 1+ 35 PY Ma ode (6) 
da db dC 


Denoting by J* the transpose of this matrix, it is evident that if we form 
the product matrix J*J, subtract unity from each of the 3-diagonal elements 
and halve the result, we get the matrix of coefficients of the quadratic form 
(2) and this is the strain matrix (5). Murnaghan (1951) has employed this 
form as the definition of the strain matrix in the following way. 


7 =4(*J — E). (7) 


In (7), E, is the unit matrix of dimension 3. We also note here that 
the subtraction of E, indicates that the original length is being subtracted 
from the new or altered length between two points, which means that in 
our treatment, we are excluding all such displacements which keep the relative 
distance between every two points within an element unaltered. Rotations 
are displacements of this type, and in the finite theory, the removal is effected 
from the product matrix J*J and not from J itself as in the infinitesimal 
theory. J thus need not be a symmetric matrix, nor is it rendered so, 
in the finite theory. A simple shear given by 


x=a+nb, y=b, z=Cc, 


is an example of an unsymmetrical J. However the strain matrix formed 
from the same by the formula 7 = 4(J*J — E;) is symmetric. When 
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formulating the most general displacement, we must therefore take J with 9 
independent components (and not reduce to 6 by removal of the anti- 
symmetric part), so long as finite deformations are concerned. We shall 
have occasion to refer to this question again at a subsequent stage in this 


paper. 


3. STRAIN ENERGY EXPRESSION IN THE FINITE THEORY, 
WITH AND WITHOUT AN INITIAL STRESS 


The strain energy ¢, measured per unit volume of the initial state, would 
be a function of the 6 strain components 7,...y., and can therefore be 
expanded as a power series in the 7’s. 


tost+...... (8) 


The constant ¢, has no significance for us, as we are interested only in the 
derivatives of ¢. The first order term ¢, will be a linear function of the 
n’s of the form 


= Cym + Cone + Cans + + + (9) 


In the infinitesimal theory of elasticity, the 6 components of stress T,, T., ... 
T, corresponding to the 6 independent elements in the (3, 3) symmetric stress 
matrix Tj, are obtained as the derivatives of ¢ with respect to the corres- 
ponding strain component 


Tx = a ° (10) 


Like the strain tensor, we use here the notation T,,; = T,, Tz: = T2, T33 = Ts, 
Tes = Tos = Fe Ta = Tis => Ts Tis = Ta => Te. In the finite theory, the 
stresses are given, as shown by Murnaghan, as the elements in the matrix 
derived from the formula 


T = Bye, 

Pa Gh 
Here pz is the density of a volume element after deformation while pg is its 
density before the deformation. If Vz and Vq be the corresponding volumes 
per unit mass of the substance, we have the relations 


Px _ Va (12) 
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To apply formula (11), we have to first write out the energy as a function 
of all the 9 strain components nj, i.e., disregarding the symmetry relation 
Nik = 7ki. Same procedure has to be adopted when applying (10) and 
(9) as well. From this the energy derivative matrix 24/dyi, has to be 
obtained and the product matrix of J, 3¢/3n and J* be formed. This has to 
be finally multiplied by pz/pg = (1/det.J). Such calculations are in 
general laborious. 


Whether the stress formula is regarded as based on (10) or (11), it is 
easy to see, on differentiating ¢, that the first order coefficients c,, cy... Cg 
must be equal to the initial stress components T,°, T,° ... T,° whenever 
such stresses are present. When we consider the superposition of strains 
5, -- Sng over finite strains 7,°, 72° caused by stresses T,°, T,° 
-++ JT, the general form of ¢, will be 


$1 = + + + 2T + + 2T a3 


When the initial stress is a uniform hydrostatic pressure P, which is the case 
under consideration in the present paper, 


= —P (8m + + (14) 


—P being used for the stress, since P is generally kept positive. This first 
order term in the energy will not occur whenever the energy is measured 
from an initial state of zero stress. 


The second order energy $s, being a quadratic form in the 7’s, contains 
as coefficients the usual second order elastic constants, which correspond 
with the generalized Hooke’s Law, while the third order term ¢, contains 
the third order elastic constants as already mentioned, these coming into 
play when the strains are large. Following Birch (1947) and Hearmon 
(1953), we can write the full energy expression for a substance of cubic 
symmetry, when the strains are all measured from the state of zero strain, 
i.e., a stress-free state, as 


= + $3 
+ + 3”) + Cie + + 


+ (m4? + 287 + 167) + Cin no® + 73°) 

+ (nr + 72) + 2293 (M2 + 13) + 19% (ns + 

+ Cresmiens + + (nina? + + 

+ Cass {71 + 06°) + 72 (04? + 167) + + (15 
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This contains 3 second order constants and 6 third order constants. 
The general method followed in writing the third power terms is to expand 
$3 and write it as 2 Cpgrnpnqnr, where p, q, r take any values | to 6 subject 
to the condition p <q <r. 


When we consider the casé of superposition of an infinitesimal deforma- 
tion Sy over an initial state of finite strain » corresponding to an initial 
hydrostatic pressure P, the energy for the 5 deformation should be written, 
to be consistent with the above formulation, as 


= 8" (Bn) = — P (Bm + + + + + + 


bye (8m + + + + + (16) 


The higher powers are neglected as the deformation 5y is treated as infinite- 
simal. The 5» are measured from the y state and the energy is per unit 
volume of this 7 state. 


4. EVALUATION OF THE STRAIN ENERGY FOR MATERIAL UNDER 
HYDROSTATIC PRESSURE 


We now proceed to evaluate the Strain Energy function defined in (16) 
in terms of the constants c,,, Cj, etc., of the stress-free state, defined in 
(15). For this purpose, we first carry out the finite hydrostatic deformation 
of the medium by which a point (a, b, c) is moved over to a point (Xo, yo, 
Z) given by x»=(1+ yo=(1+ z%=(1+7)c¢ where 


giving the magnitude of the compression as a change of length per unit 
initial length. The corresponding Jacobian is 


0 0 
0 0 1+7 


From the Murnaghan formule for strain and stress (7 and 11), we get 
the strain and stress matrices respectively as 
7 + 7/2 0 0 
0 n + 7/2 0 (18) 
0 0 + 3/2 


( 
t 
t 
J 


_ dw_ 
7"  & 
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ag 
| 0 0 \ 
19 
7 
\ 


Substitution of the strain matrix values (18) in the energy expression 
(15) gives us the energy due to the hydrostatic pressure alone. To obtain 
the energy for the initial strain plus the infinitesimal strain, we formulate 
the corresponding J. If as a result of this additional strain the point (Xo, 
Yo: Zo) Moves over to (x, y, z), the most general relationship between them 
will be given by 

x = (1 + 81) Xo + + 5520 
= + (1 + 82) Yo + 


Z = 85xXq + S4¥o + (1 + 85) Zo. (20) 
The transformation x,—>x being thus given by the Jacobian 
5s 
35 


In formulating Js in this form with 6 components, we have assumed that 
the rotations are removed beforehand. Special cases such as that of a 
simple shear, superposed over a finite strain, would require the considera- 
tion of an unsymmetrical J. We will not, however, consider such a case 
here for the sake of simplicity. 
Now substituting the values of x» = (1 + »)a, etc., we finally get the 
transformation from the initial position a, b, c to the final position x, y, z as 
x= Ja 
where 
J=(1 +) Jp. (22) 
Forming 4 (J*J — E;), we obtain the strain matrix for this entire deformation 
7 + Sn, as 
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where R = » + »?/2 = Initial strain, g = 1+ 7 
5m, = 5, + + 55? + 56°) Sg = 84 + (5254 + 8354 + 
Sng = + $(5q? + 542 + 56?) = + + 8555 + 545g) 
Sys = S53 + $(53? + 542 + 55”) 9g = Sg + $ (515g + + 5455) 
(24) 


It should be noted here that we are maintaining the distinction between 
the 5 matrix (21) which specifies the displacement, and the 5y matrix which 
is the strain matrix corresponding to the J; transformation. Further, while 
we have taken our additional deformation as infinitesimal, we are still 
keeping the square terms 4,?, etc., in the expression for 5y. This is neces- 
sary because -the energy of the infinitesimal deformation ¢’ will ultimately 
have terms up to squares and products of the displacement derivatives 5,, 
8, etc. If ¢’ contained only quadratic terms in the 8n, it would have been 
wholly unnecessary to keep any square of the 4 in the expression for 5». 
However, when an initial stress is present, ¢’ contains linear terms in the 
5» which give rise to terms of the second degree in the 8’s as is evident from 
(24). Hence we note the important point that, when an initial stress is 
present, even an infinitesimal strain thereon will have to be treated as a 
finite one only, no approximation being justifiable in the value for the strain 
element. After the energy expression is formed using the correct value 
of the strain component, terms higher than the second powers of the dis- 
placement derivatives could be neglected, to justify its classification as an 
infinitesimal deformation. 


We can substitute the values of the strains from (23) to get the total 
energy. An equivalent, but somewhat simpler method would be to note 
that an increase of 5» from the ‘y’ state gives rise to an increase of 4n = 
(1 + 7)? 5» in all the elements of the strain matrix. Here Ay is the increase 
in strain when the strain is measured with reference to the stress-free state, 
while 5y is measured from the state of finite strain 7. As the energy expres- 
sion (15) refers to the initial stress-free conditions, any increase of the 
energy due to an increment of 4y in the strain, can be most simply got 
from the Taylor formula, 


25) 


neglecting higher powers of the 47. This is the energy of the infinitesimal 
deformation when referred to a unit volume of the stress-free state. We 


m 
eC 
| 
| 
1 


— 
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multiply it by 1/DetJ, to refer it to a unit volume of the ‘y’ state [see 
equation (12) in this connection], i.e., by a factor (1 + y)*. According to 
the energy method explained earlier, we equate this quantity to ¢’ to get 


(1 + + + + + Cizs)} 
(1 + 9)? (8m + + Sys) 
+ $ {ey + R + 4Cr12)} (1 + 9)* (81? + + 
+ {er +R.Cyog-+R .4C p32} (1 82573 + 


= — + + + 28 (8m? + + 873") 


+> Dis (5,572 + 5253 53571) 
+ (Sng? + + S76”). (26) 
Equating the coefficient of 5y, on either side, we immediately get 


2/2 2/2)2 
=P = (en + 20) 4 + + 


which, on development in powers of 7 and retaining up to 7?, becomes 
— P= + 2¢12) + 1? (3Cn + + Ciss — cu). (27) 


In the same manner by equating coefficients of 5,°, 5n,5y, and 5n,? we get 
by = Cu + + 6Cin + + 0? + (28) 


= Cig + 9 (Cre + + Cres) + 7? (6Cuz +3 ) (29) 
¢ 
Ca +7 (cas + + Cuss) + 7? G Ciaa + 5 Ciss) 


5. EFFECTIVE ELASTIC ENERGY AND EFFECTIVE ELASTIC CONSTANTS 


So far our task has been to derive an expression for the additional strain 
energy due to an infinitesimal deformation in powers of the strain components 
8», each of which however involved not only terms linear in the 5, 53, etc., 
but quadratic terms in them as well. For further discussion on the elastic 
behaviour of the substance from the ‘yn’ state, it will be necessary to express 
this energy in terms of 5, --- 5g. Substituting from (24) in (16) and retain- 
ing terms up to the second power of 8’s only, we get 


) 

n 

h 
le 

ll 

y 

1s 

n 

n 

a 

n 

n 
e 

t 
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P (6, + + 53) + P) + 5,” + 537) 
+ (815, + 5253 + 535.) + (2b 44 — P) (Se? + + 5,2). 
(31) 


The expression ¢’ refers to the total deformation energy which should 
be rendered available from external sources in order that the deformation 
n to » + 8» could be effected on a unit volume element in the 7 state. It 
has to be remembered however that an external force, namely the hydrostatic 
stress, is already present and there is the potential energy associated with 
this force. When displacements resulting in deformations 5,, 5, 5, take 
place, an amount of energy 4W = — P(, + 5, + 45) is released from this 
external potential energy. In other words the work done by the existing 
external force in this displacement which is 4W can go in, as increase in 
internal energy, i.e., the strain energy, and thus meet the part of the total 
requirement ¢’. Hence the extra energy, that should be made available in 
order that the deformation » to » + 5y may take place, is given by ¢e = ¢’ 
— AW. de, which is the effective elastic energy, represents the energy asso- 
ciated with an infinitesimal deformation which takes place from a state of 
finite strain, just as the usual elastic energy ¢ refers to a similar deformation 
of a state with zero initial strain. ¢¢ is thus obtained from ¢’ by the removal 

of the linear terms —P (5, + 5, + 53). We have 


de = ¢' —4W=¢' + P(8, + + 85) = (81? + + 85%) 


12 (5,5, 5255 535,) + 2c’ (5,? 55” 54”) (32) 
where 


C1 = by — P= Cy + 9 (2 + + + 


+7 + 12Cy2 + — 3 as) 


= Dye = Cre + (Cre + + + 7? (5Cus + 5 Cun) 
P 
= Dag — 5 + + +S + Cass) 
3 
+7? Cin + + + 


+5 Cus — (33) 


Th 
WI 
as 
an 
des 
the 


ea & 


wi 
Wi 
lir 
sti 
al 
m 
fc 
m 
re 
st 
sl 
st 
T 


32) 


3) 


Elastic Behaviour of Matter Under Very High Pressures 13 


The new constants c’,;, C’y2, C’ 44 Will be called the effective elastic constants. 
While ¢¢ could have been directly derived by substitution of relations such 
as 5n, = 5, + 4(8,? + 5, + 5,?), in the function ¢, the development of 
an intermediate function ¢’ as has been done in this paper helps us to retain 
$y as it is, till a convenient stage is reached, whereafter ¢e could be easily 
deduced. This reduces the labour involved in a full substitution of 5y from 
the beginning itself. 3 


The 5,, 5, ... 5g are the displacement derivatives. In the infinitesimal 
theory, they are directly identified with the infinitesimal strain components 
themselves. We can therefore get the stresses required for effecting the 
§ deformation by the usual processes of differentiating the appropriate 
strain energy function, which in this case is ¢?e. We thus obtain for the 
additional strains t,, t..-. tg the relations 


ty = + (8, + (34) 
with similar equations for ¢, and fs and 
ty = (35) 


with similar equations for ¢, and ft. (34) and (35) are indicative of the 
linear relationships which connect the additional stresses with the additional 
strains through the effective elastic constants C’ 44. 


6. DERIVATION OF EFFECTIVE ELASTIC CONSTANTS FROM 
STRESS MATRIX METHOD 


In the foregoing sections we have indicated the derivation of the effective 
elastic energy and the effective constants from the energy method. It is 
also possible to derive the same from the stress matrix and this method has 
been adopted by some workers (Birch, 1947; Hughes and Kelly, 1953). The 
method is not simpler, since in any case, the energy function has to be 
evaluated as an essential step for the derivation of the stress from the Murna- 
ghan formula T = (pz/pq) J (34/3) J*. We form the appropriate matrices 
for an (m + 8) transformation, and after the product matrix of all the three 
matrices J, 3¢/3n, J* is worked out and the same is multiplied by pz/pg, we 
retain only the constant and terms linear in the 3’s. This gives the total 
stress appropriate to an 7+ 6 deformation. From this, the extra stress 
appropriate to the infinitesimal deformation has to be found out. A direct 
subtraction like T,,; — T, to obtain the extra stress is incorrect, since the 
stress, being referred to the final state, has different reference conditions for 
T,.9 and T,, (which is really the pressure P in our present case). In this 
respect the position is more complicated than in the energy method. Whereas 
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¢(m + 8) — (my) employed in our energy method is an allowed subtraction 
as both refer to the common condition of a state of zero stress, Tos — Ty 
in the stress matrix method is not an allowed subtraction in view of the 
difference in the reference frames. We have to refer T,,, and T,, to common 
reference conditions and then deduct the effect of the pressure. Birch cal- 
culated the total stress T,, = T, and equated it to an expression — P + c’,,8, 
+ c's. (5, + 53), wherein he identified c’,, c’y,, etc., with the effective elastic 
constants. However, they would be so only if T,, + P(it has to appear 
in this form as T,, is negative while P is kept positive) represents the extra ] 
pressure. But as pointed out earlier, the removal of the pressure term in 
this manner from T,; is not justified. Likewise he calculated T,, and on 
the basis that there is no initial shearing stress he identified T,, with 2C' 445¢: 
Here again, he has not taken into account the fact that a pressure P in an ( 
n State is equivalent to a stress tensor of a somewhat different type when 
referred to the 7 + 4 state; the representation of the pressure in the » + 4 
state has in fact a component —Pé6, in the T, = T,, term, and —P(1 — 4, 
— §,) in the T,, term as will be shown in the next paragraph. So, as long 
as the Murnaghan formule for stress, which refer everything to the final I 
state, are employed, the pressure should also be referred to those final con- 
ditions, and to obtain the excess stress we have to subtract — P(1 — 5, — 8) 

from T,, and similarly subtract — Pd, from Tj,. As the correction does 

not involve §,, c’;,; is unaffected and it follows that c’,, as derived by us 

should be identical with that obtained by Birch. As the 5, term in T,, now 

comes with an additional correction —P6,, the value of c’,, obtained by a 
Birch will have to be corrected by subtracting P. Likewise, c’,, obtained 

by him should be corrected by the addition of P/2. We give below the 

values obtained by Birch which were developed up to the first power of 7 S 
along with our values up to the same power, although in the actual working 
given by us earlier, we have developed up to 7?. 


Values of Birch Values obtained by authors 

Cn = Cy + +4C = Cy +9 +4C yy) 

C , 

cul = Cut + Cu + + = Coa tn + + + . 
+ Ciss + Ciss fr 
— (ey + 2¢i2) — P= (ey + ti 
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The value of P when referred to the final 7 + 5 conditions can also be 
obtained by using the formula given by Murnaghan T¢dS* = TdS* where 
T® is the stress in an initial frame with a surface area element given by dS?, 
T the stress in the final state with an element of area d§*. The relation 
between dS* and dS® is 


dS* = (J*)— dS@. (37) 
Hence we get 


= 
Px 


or 
T = PE (38) 
Pa 
In our case, the 7 state is the initial state and the » + 4 state is the final state. 
Hence 
1+ 8, 55 
35 & 
and 
Pe 


Substituting these values, we find that the stress — P in the » state appears, 
in the 7 + 4 state, as the stress given by the matrix 


— Pb, —Pé, 


Hence these matrix elements have to be subtracted from Tj), Ty, etc., before 
the correct extra stress is obtained. 


Instead of referring P to the final » + 5 state and removing its effect 
from Ti, etc., one could also refer the total stresses T,;, etc., to 
the » state and then remove P directly. This will not involve any modifica- 
tion of the pressure, whereas the stresses, since they are referred to the 
» state as the initial state, can be deduced from the simpler formula T¢ = 
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J (26/2). Here is what Murnaghan (1951) calls the modified stress 
matrix. It can be verified that results identical with the above are obtainable 
by such a procedure as well. 


S. BHAGAVANTAM AND E. V. CHELAM 


7. DISCUSSION OF RESULTS 


Experimental data concerning the third order constants are very meagre. 
Whatever is available [Hearmon (1953), Hughes and Kelly (1953)] suggests 
that all these constants are negative and an order of magnitude larger than 
the second order constants. In the discussion given by Hearmon (1953), 
the data obtained by Lazarus (1949) concerning c’j, in respect of the cubic 
crystals KCl, NaCl, CuZn, up to hydrostatic pressures of 10,000 bars were 
considered. Hearmon plotted the ratio c’j,/cjy, against » and obtained 
a nearly straight line relationship. Using the values of the slopes of these 
lines, he derived values for the 3 third order combinations Cg, Cp, Cg given 
by 


Ca = + Co = + Ca = $C + 
In NaCl, for example, Cg was — 100, Cy was —14 and Cg was — 11, in 
units of 10% dynes per sq.cm. The corresponding values of the second 
order elastic constants are roughly 4-9, cy =1-2, cyy=1-2. In 
view of the method employed, Hearmon has cautioned that the accuracy 
of these values is not likely to be high. However, the order of magnitude 
of the third order constants suggested by these calculations seems to be 
correct, and further work of Hughes and Kelly (1953) supports this view. 


As the readings were taken in a portion of the (c’jx/cix, ») graph which 
was nearly straight, it is clear that, in this region the 7 term in the formule 
(33) could be neglected. For the same reason, there would not be any 
appreciable difference whether Birch’s expressions of the effective elastic 
constants, or those derived by us from the energy expression method are 
used. When, however, we use the values of the third order constants as 
derived above for discussing the behaviour of the energy function for large 
yn, the »? term, as also the effective elastic constants as derived by us from 
the energy function will have to be used. 


Thus, neglecting the second order constants in comparison with the 
third order ones, we find from (33) that each of the effective elastic constants 
can be represented as, 


Cik = Cik + An + By? (40) 
where A equals Cg, Cp or Cg respectively as defined earlier, and B is nearly 
equal to 2A in case of c’;, and 3/2 A in case of c’,;. and more than 3/2 A in 


( 
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case of c’,4,. A is ordinarily negative and as 7 itself is negative for compres- 
sion, the formula shows that the elastic constants should, as a rule, increase 
with increasing compression, so long as 7 is not too large. Such an increase 
is to be expected, since these elastic constants express the magnitude of 
energy required for deforming a substance which is already in a compressed 
state. As the compression increases, the energy required for compressing 
further should naturally increase indicating increasing resistance of the 
substance to further deformation. What is interesting, however, is that 
the formule (33) and (40) indicate the possibility that the elastic constants 
might increase up to a point only and then start decreasing. That this will 
happen, other factors remaining unchanged, when | + 3n = 0 or 7» = —0°3, 
in the case of c,. and when 7 = — 0:25 in the case of c,, and less than 
— 0-3 in the case of c’4, may be inferred by equating (dc’;,/dn) to zero. 
However 7 = — 0:25 to — 0:3 corresponds to a very large strain difficult 
of realisation in practice. Since a decrease of the elastic constants with 
increasing pressure is itself indicative of anomalous behaviour, we have to 
conclude that matter should be expected to behave in an abnormal manner 
when the magnitude of compression becomes as high as 0-25 to 0°3. 


This of course is a purely qualitative conclusion, and the magnitude 
might well be somewhat different when we develop c’j, to include 7* terms 
also. Other physical conditions such as the temperature are bound to play 
an important part and these have not been considered. 


8. EFFECTIVE ELASTIC ENERGY IN NORMAL CO-ORDINATES 


We note that the function ¢¢ given by (32) is separable in the variables 
54, 5;, 5g but not in 5,, 5,, 53 because of cross-products. To discuss the 
behaviour of this function, it is necessary to make a ‘normal co-ordinate 
transformation’ so that each variable could be treated independently. We 
introduce new orthogonal variables given by 


6, = (8s + 8: + 8s) 


= 5 (81 — 8s) 


9s (8; + 253). 


In terms of these, we rewrite (32) as 
, 3 , 3 
be = (Cn + (Cn ey) 


“+ 2c" (54? + 55? + 8°). 
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The system will therefore become unstable when any of the coefficients 
+ — OF C’44 becomes negative. Of these, the first is only 
hypothetical, because c’,, and c’,, being initially positive, the term c’;; — C’j. 
will, if at all, become negative long before c,,' + 2c,,’ does so and we may 
safely exclude the first case. As regards the other two coefficients 4(c’,, 
— C'y2) and c’44, we notice that both are identical in the isotropic case. 
For such a case, we may take the criterion that as soon as c’,, tends to assume 
negative values, instability sets in. With the present meagre available data, 
it is not possible to calculate the magnitude of the compression at which 
this phenomenon sets in for any particular case like the interior of the earth. 
However, the form of c’,, especially in its expression as b4, — P/2 suggests 
that with monotonously increasing pressure, the possibility of its becoming 
negative at some stage of the compression cannot be ruled out. This is 
especially so in substances in which cy, goes on decreasing with pressure 
such as in the case of KCI even from the very beginning (Hearmon, 1953). 
Even if c4, increases at first and decreases later due to the importance of 
the »? term, or decreases due to other causes such as the effect of tempera- 
ture, a stage will be reached when c’y4 becomes negative. It is obvious 
that the substance then becomes unstable under the action of shearing stresses 
and this may be interpreted by saying that it can no longer sustain shear 
waves. 


A more detailed discussion of the instability problem, and the exten- 
sion of the present work to include a finite initial stress of any type, will be 
taken up in subsequent papers. 


9. SUMMARY 


In order to study the elastic behaviour of matter when subjected to 
very large pressures, such as occur for example in the interior of the earth, 
and to provide an explanation for phenomena like earthquakes, it is essen- 
tial to be able to calculate the values of the elastic constants of a substance 
under a state of large initial stress in terms of the elastic constants of a 
natural or stress-free state. An attempt has been made in this paper to derive 
expressions for these quantities for a substance of cubic symmetry on the 
basis of non-linear theory of elasticity and including up to cubic powers 
of the strain components in the strain energy function. A simple method 
of deriving them directly from the energy function itself has been indicated 
for any general case and the same has been applied to the case of hydro- 
static co~pression. The notion of an effective elastic energy—the energy 
require to effect an infinitesimal deformation over a state of finite strain— 
has been introduced, the coefficients in this expression being the effective 
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elastic constants. A separation of this effective energy function into normal 
co-ordinates has been given for the particular case of cubic symmetry and 
it has been pointed out, that when any of such coefficients in this normal 
form becomes negative, elastic instability will set in, with associated release 
of energy. 


ACKNOWLEDGEMENT 


One of us (E. V.C.) is indebted to the Government of India in the 
Ministry of Communications and to the Director-General of Observatories, 
New Delhi, for kind grant of study leave, which has rendered it possible 
for him to take up the present work. 


REFERENCES 
1. Bhagavantam S., and Acta Cryst., 1949, 2, 21. 
Suryanarayana, D. 
2. Bhagavantam, S. .. Third Order Elasticity—Proceedings of the Third Congress on 
Theoretical and Applied Mechanics, Bangalore, 1957. 
3. Birch, F. .. Phys. Rev., 1947, 71, 809. 
4, .. J. Geo, Research, 1952, 57(2), 227. 
5. Fumi, F. G. .. Phys. Rev., 1951, 83, 1274. 
6. Green, A. E. and Zerna,W. Theoretical Elasticity, Oxford University Press, 1954. 
7. Hearmon, R. F. S. .. Acta Cryst., 1953, 6, 331. 
8. Hughes, D. S. and Phys. Rev., 1953, 92, 1145. 
Kelly, J. L. 
9. Lazarus, D. .. Ibid., 1949, 76, 545. 
10. Murnaghan, F, D. «+ Finite Deformation of an Elastic Solid. John Wiley and 


Sons, New York, 1951. 


y 
12 
y 
e 
i, 
h 
1. 
is 
is 
e 
). 
of 
1- 
1S 
ir 
l= 
ye 
1, 
1- 
a 
e 
rs 
d 
d 
)- 


THE RAMAN SPECTRA OF POLYCRYSTALLINE 
SOLIDS 


By D. KRISHNAMURTI, F.A.Sc. 
(Memoir No. 123 from the Raman Research Institute, Bangalore—6) 


Received June 23, 1960 


1. INTRODUCTION 


CRYSTALLINE solids as found in nature or prepared artificially usually appear 
as polycrystalline masses, large and well-developed crystals being the exception 
rather than the rule. In the study of the Raman effect, it is often necessary 
to make use of such material. The internal reflections within the solid and 
the parasitic illumination resulting therefrom create difficulties in the success- 
ful recording of the spectra. These difficulties are enormously enhanced 
when, as is often the case with artificially prepared material, the solid appears 
in the form of very fine particles which are non-adherent or only loosely adherent 
to each other. The special techniques necessary to obtain any useful results 
with such material have been the subject of numerous investigations in the 
past, reference to which will be made later in the paper. 


In the present author’s studies on the Raman spectra of crystals, exten- 
sive use has been made by him of the A 2536-5 resonance radiation of the 
mercury arc and of the special technique originally proposed by Rasetti in 
that connection. It appeared to him therefore to be of interest to explore 
the possibility of using the same technique for satisfactorily recording the 
Raman spectra both of polycrystalline solids and of the finest crystal powders. 
The present paper is of a preliminary nature. It discusses the problems 
which arise in this connection and presents the results obtained in a few typical 
cases of interest. 


2. A SURVEY OF THE PROBLEM 


The theory of the propagation of light in polycrystalline media developed 
by C. V. Raman and K. S. Viswanathan (1955) and discussed by C. V. Raman 
(1957) with reference to the facts of observation provides an insight into 
the behaviour of these substances with regard also to the diffusion of light 
in them, since the properties in relation to transmission and diffusion of light 
are complementary to each other. The following points are germane to our 
discussion here on the nature and origin of the parasitic scattering and how 
20 
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best it can be minimized. When the sizes of the crystallites are large com- 
pared. with the wavelength ot light and the birefringence of the solid is appre- 
ciable, the effects arising may be considered from the standpoint of geometri- 
cal optics as reflection and refraction of the incident light at the intercrystalline 
boundaries. Since for a finite volume of the sample the total area of the 
boundaries which scatter increases with diminishing particle size, the para- 
sitic scattering of light would naturally be greater in the case of finer particles, 
and a very small thickness of the medium suffices therefore to diffuse the 
incident light totally in directions backward. However, as the size of the 
crystallites diminishes further and becomes comparable to that of the wave- 
length of light the effects arising have to be considered from the point of 
view of wave-optics. In the case of a weakly birefringent solid the diminution 
in the particle size at such a stage should result in a diminution in the scattering 
and hence correspondingly increased transmission. 

The foregoing remarks have a bearing on the case of fine crystal powders, 
Since the intensity of the diffusion of light at the boundaries of the crystallites 
depends on the refractive index of the substance with reference to the medium 
surrounding it, the immersion of the powder sample in a medium of nearly 
equal index should considerably reduce the parasitic scattering. The technique 
of forming pellets of the substance diluted in a solid medium of nearly equal 
index as is done for infra-red studies may also be effective in reducing the 
parasitic scattering. Finally, the use of particles of colloidal dimensions 
should also be helpful in such studies. 

We shall, in the following, consider the effects arising from the parasitic 
illumination and how the earlier investigators tried to minimize it. The 
parasitic scattering besides causing an intense halation close to the exciting 
line produces a general fog over the entire region of the photographic plate, 
the latter effect arising from the multiple reflections of the light by the optics 
within the spectrograph. In addition, the continuous spectrum that is 
invariably present even in low pressure arcs is also recorded strongly. Asa 
result, not only does it become impossible to observe any Raman lines in the 
vicinity of the exciting radiation, but also weak Raman lines of even high- 
frequency shifts are masked by the general background of illumination. 
Apart from all this, the imperfect transparency of the specimens themselves 
(resulting from their polycrystalline nature) causes only a very small depth 
of the substance to be effectively operative for observing the Raman scattering. 

One of the important advances made in the study of the Raman spectra 
of crystal powders is due to R. Ananthakrishnan (1937) who developed what 
is known as the technique of complementary filters. A concentrated solution 
of iodine in carbon tetrachloride was interposed in the path of the incident 
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radiation to absorb all the visible mercury lines excepting that at A 4046. 
The scattered light was filtered through a strong solution of sodium nitrite 
to absorb the exciting radiation 4 4046. By this procedure he was able to 
obtain fairly clear spectra exhibiting the high-frequency Raman lines ; however, 
since the absorption of the NaNO, solution extended over a finite region 
on either side of A 4046, the low-frequency lines could not be recorded. A 
similar technique was also developed by him for use with A 4358 excitation. 
Another technique used with success was due to Billroth, Kohlrausch and 
Reitz (1937) who constructed a special spectrograph consisting of two parts, 
each with a prism and two lenses and with a common slit between them. 
The light was incident on a plate on which was spread the crystal powder. 
This plate was set at such an angle so that its position was conjugate to the 
position of the photographic plate on which the lines were recorded. In 
effect, the lines of different wavelengths recorded on the photographic plate 
emanate from different corresponding points on the former plate on which 
the powder is deposited. By screening off the point on the plate from which 
the radiations A 4358 emanate, the overexposure arising due to this line was 
greatly reduced. Apart from such methods, absorption and interference 
filters have also been used to monochromatize the incident radiation and for 
removing the continuous background close to the exciting line. P. Krishna- 
murti (1930) used low-pressure arcs which had only a weak continuum and 
removed even this by a filter of didymium chloride solution. J. Brandmuller 
(1953) and M. C. Tobin (1959) have employed interference filters for mono- 
chromatizing the incident readiation, as also for suppressing the exciting 
radiation attendant with the Raman scattering. 


Though with the aid of the above techniques it is possible to record the 
strong lines in the high-frequency region of the spectrum, the recording of the 
lattice frequencies which are of much interest in the case of studies on crystals 
is still a difficult problem. This is due to the fact that the absorption character- 
istics of all known filters extend over a finite range on either side of the 
exciting radiation that they are intended to suppress. On the other hand, 
the use of interference filters require that parallel light be employed, with the 
consequence that the scattered beam that can be brought to a focus on the 
slit of the spectrograph is of considerably reduced intensity. In addition, 
the low scattering power in the visible region of the spectrum coupled with 
the fact that only a small depth of the substance is effectively available for 
observation of the Raman scattering makes it very difficult to record weak 
Raman lines. It is in these respects that the technique of excitation of the 
Raman effect with the a 2536-5 radiation, which we shall discuss in the follow- 
ing section, proves to be advantageous, 
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3. THE RASETTI TECHNIQUE 


The details of this technique which have been fully described in an earlier 
paper by the author (1957) consist mainly of (i) the use of the intense A 2536-5 
resonance radiation emitted by a water-cooled magnet-cotrolled quartz. 
mercury arc and (ii) the suppression of the parasitic illumination of A 2536-5 
radiation attendant with the Raman scattering by a filter of mercury vapour 
inside the spectrograph. The principal advantages in the technique arise 
from (i) the exceptional intensity of the exciting line compared to the neigh- 
bouring lines of the spectrum, (ii) the increased scattering power in the ultra- 
violet region of the spectrum and (iii) the successful elimination of the exciting 
radiation from the scattered light, thereby enabling the observation of the 
lattice lines appearing close to it. The power of the technique can be gauged 
by comparing the results of two studies by the author (1956, 1957) on magne- 
site and calcite. Whereas with the former and using the visible radiations of 
the mercury arc, only five lines could be observed with difficulty, in the latter 
no less than thirty-three frequency shifts could be clearly recorded. Apart 
from studies with single crystals, this technique has also been employed by the 
author (1959) to record the Raman spectra of several solid hydroxides, most 
of which were available either in the form of polycrystalline masses or as fine 
powders. It had been noticed that, though the strong Raman lines could 
be easily recorded, the continuous spectrum accompanying the exciting radia- 
tion impedes the observation of weaker Raman lines. Monochromatization 
of the incident beam, it is hoped, would help in the recording of the weaker 
lines also. 


4. THE PRESENT STuDY: ITS RESULTS AND DISCUSSION 


In the present study, we have considered two classes of substances, viz., 
fine powders and compact polycrystalline solids. While in the former the 
crystallites are only loosely adherent to each other, in the latter the different 
microcrystalline domains of the substance are either linked together by atomic 
and intermolecular forces or held together by a cementing medium to form 
a compact solid. Besides the general interest attached to studies on the 
Raman spectra of such materials, there arise the important questions whether 
their Raman spectra would exhibit any variations depending on the size of 
the crystallites, and if so how small they have to be before such effects can 
be observed. 


Calcite was chosen as a typical case for the exploratory studies on fine’ 
powders since its Raman spectrum is very well known. Two samples of 
CaCO, were used to record the spectra. One of them was the readily available 
commercial powder which contained aragonite as well, The method of 
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sedimentation was used both to obtain a powder of uniform particle size and 
also to remove the aragonite—which being denser than calcite settles at a fast 
rate. The other sample used was calcite precipitated in the cold by the addition 
of solutions of CaCl, and K,CO;. The powder was deposited thinly on the 
inside of a glass bulb which had two windows cut out for incident and scattered 
radiations. The spherical form of the bulb served to effectively utilize the 
exciting radiation by multiple reflections inside the container. The size of 
the particles of the powder used to record the spectrum was about | w and 
was estimated under a high-power microscope. The spherical bulb was 
kept jammed against a water-cooled magnet-controlled quartz mercury arc 
intensely emitting the A 2536-5 resonance radiation and the scattered light 
was focussed on the slit of the spectrograph by a quartz condenser. 


Plate II accompanying this paper reproduces the microphotometer 
record of the Raman spectrum. The frequency shifts observed therein, 
viz., 1086, 282 and 155 cm.~! are fairly sharp and are identical with the fre- 
quency shifts recorded with large crystals of calcite. The spectrum does 
not exhibit any alteration either in the frequency shifts or their widths. Be- 
sides the above three lines, it should have been possible to record two other 
weaker lines at 712 and 1436 cm.-! but for the fact of the continuous spectrum 
of the arc masking them. 


As typical cases of polycrystalline solids, we have chosen alabaster 
wherein the crystallites are randomly orientated, and fascicular or columnar 
gypsum consisting of crystallites in the form of rods (the b-axes of the crystal- 
lites being the axes of the rods) arranged parallel to each other. The X-ray 
diffraction pattern of the specimen of alabaster used in the present study had 
been reported earlier by C. V. Raman and A. Jayaraman (1954) and exhibits 
spotty rings indicating the random orientation of the crystallites in it, the 
particle sizes being of the order of 10». In the case of fascicular gypsum 
the rods though of minute dimensions are easily visible. The X-ray diffrac- 
tion patterns recorded for it show that whereas the b-axes are all parallel 
to each other, the c and a axes exhibit a limited range of variation. Owing 
to the low birefringence of gypsum, both the specimens were translucent 
and their spectra could therefore be recorded without much difficulty. The 
spectrum of a single crystal of selenite in the form of a rectangular tablet was 
also recorded for comparison. The exciting radiation was incident on the 
broad face of the crystal and the scattering observed along an edge (perpendi- 
cular to the b-axis). In the case of fascicular gypsum the exciting radiation 
was incident parallel to the b-axis and the scattering observed perpendicular 
thereto, 
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Figures (5), (c) and (d) in Plate I accompanying this paper reproduce the 
Raman spectra of selenite, fascicular gypsum and alabaster recorded with 
adequate intensity. Figures (a) and (e) show the spectrum of the mercury arc 
for comparison. It is evident that the spectra recorded in the three cases 
exhibit no differences with regard to either the positions or the widths of the 
lattice and internal frequency shifts. The only noticeable feature is a variation 
in the relative intensities of the two intense water-bands ; this feature is a 
well-known orientational effect arising from the fact of the water molecules 
in gypsum being arranged in layers perpendicular to the b-axis (vide, e.g., 
Roop Kishore, 1942). Finally, it should be remarked that with the same 
specimens it would have been impossible to obtain such clear spectra exhibit- 
ing the lattice lines with the visible radiations of the mercury arc as exciters of 
the Raman effect. 


That no differences arise from the polycrystalline nature of the solids 
as well as from the finely divided form of the powders is consistent with the 
fact that the ranges of interatomic and intermolecular forces in crystals do 
not extend beyond a few lattice spacings. The individual crystallites in the 
polycrystalline samples and the fine powder used in the present study are 
still too large—each containing an enormous number of unit cells, of the 
order of 107®. Since the number of lattice cells located well within the crystal- 
lites is large compared with that of the number of cells near the surface, no 
significant effects such as an alteration in the value of the frequency shifts, 
their intensities or widths can be expected to arise. 


5. CONCLUDING REMARKS 


The present studies have clearly shown that the Rasetti technique can 
be successfully employed to investigate the Raman spectra of polycrystalline 
masses and fine powders. However, particularly in the case of fine powders 
the continuous spectrum in the arc masks the weaker Raman lines. It is hoped 
that the parasitic illumination can be reduced considerably by the following 
techniques, viz., (i) monocnromatization of the incident beam, (ii) immersion 
of the powder in a liquid of nearly equal refractive index, (iii) by dilution 
of the sample in a substance of suitable refractive index and forming tablets 
of the substance under high pressure and (iv) use of powders of colloidal 
dimensions and dispersed as in method (ii) or (iii). 


In conclusion, the author wishes to express his sincere thanks to Professor 
Sir C. V. Raman, F.R.S., N.L., for his keen interest in this investigation 
and for the helpful discussions he had with him, 
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6. SUMMARY 


The paper discusses the experimental techniques with regard to the 
recording of the Raman spectra of polycrystalline solids. The present studies 
on the typical cases of polycrystalline gypsum, and calcite in the form of a 
finely divided powder show that the Rasetti technique can be successfully 
employed to obtain clear spectra of such substances and also indicate the 
scope for further improvements in the techniques. It emerges that the poly- 
crystalline nature of the gypsum and the finely divided form of the powder 
studied have no observable influence on their spectra. 
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Fic. 2. The microphotometer record of the Raman spectrum of a fine powder of calcite (of particle 
size 1 x). 
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At the present time our knowledge on the structure of the nucleon-A hyperon 
interaction is quite poor since there are no direct measurements on N — A 
scattering. That they interact strongly is inferred from the large production 
cross-section of strange particles and from the existence of hyperfragments 
which contain a bound A hyperon in a complex nucleus. There are several 
careful measurements on the properties of hyperfragments, especially on the 
binding energies'; hence it would be desirable to develop a systematic method 
of analysis which would relate the binding energies to the matrix elements of 
the A — N interaction. 


Meson theory suggests an extremely complex structure for this inter- 
action. Since the A hyperon has spin 4, the A — N forces can be analysed 
in terms of triplet and singlet central forces, the spin orbit force, the tensor 
force and the quadratic spin-orbit force as in the n — p system. But since 
the A hyperon is an isotopic singlet, Yukawa processes involving single pion 
emission by a A hyperon are not allowed ; consequently the asymptotic 
one-pion “ tail ’’ is absent here and the range of the A — N forces is expected 
to be considerably shorter. Now the two-pion contribution to the potential 
is naturally more complicated and cannot be clearly separated from the one- 
kaon contribution. Also, at these short ranges it is no longer legitimate to 
treat the “‘ actual’’ potential in terms of a A — N system alone, since the 
reaction A + +N is possible in addition to 4+N—-+A+WN and 
hence the interaction is better described in terms of a “ potential matrix ” 
for the hyperon-nucleon system ; and the off-diagonal terms of this potential 
matrix have a long range associated with a one-pion tail. 


In contrast to this complexity the properties of hyperfragments would 
depend only on some average properties of the interaction ; a parallel instance 
is the dependence of low-energy scattering only on zero-energy scattering 
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length and effective range. Of course, one may attempt? to solve the Schré- 
dinger equation for the many-particle system using an associated interaction 
structure with unknown parameters, but this method requires for its practical 
implementation, some approximation technique like the variational method ; 
and even then, except for the simplest systems, to make the calculation practi- 
cable one has to invoke special models. We havea parallel instance in the 
theory of complex nuclei, but here the shell model has been a good first 
approximation to nuclear structure, at least as far as nuclear low-lying levels 
are concerned. Since one has to use a model anyway, it becomes interesting 
to see if it can be patterned after the theory of complex nuclei. 


We have investigated this problem*»* and developed in detail a model 
for hyperfragments. The physical postulates are that the system is well 
represented by a suitably antisymmetrized product of one-particle wave 
functions with two-particle interactions. We also make the assumption of 
j — j coupling so that the one-particle wave functions may be classified as 
Sij2, P32, etc. The hyperon is always in the §j,, state at least for 
the low-lying states ; and all hyperfragments that have so far been identified 
correspond to nuclei belonging to the S,,. and P3,. shells. The two-particle 
interactions are assumed to lead to negligible configuration mixing so that 
the low-lying levels are almost pure configurations ; this assumption is 
certainly more accurate for the energy eigen values than for the wave functions 
themselves. In the absence of inter-particle forces the ground state of the 
model is many fold degenerate, but the interactions split this ; and the low- 
lying levels of the model with interactions are all obtained from this level ; 
they all hence correspond to the same set of one-particle configurations coupled 
in different fashions. 


Having specified the model, one could proceed to calculate the energy 
levels in terms of the two-body interaction parameters. Since the nucleon 
binding energies are known, we shall take advantage of this to subtract out 
the ground state energy of the nucleon group ; since the A — N interaction 
is somewhat weaker than the N — N interaction for the first excited states 
of the hyperfragments, the nucleon group is in its ground state. In any case 
this is true of the ground state ; and for S-shell nuclei there are no bound 
excited states anyway. This restriction to hyperfragment states with unexcited 
nucleon groups permit us to label these states simply by the two spins which 
in general have two values corresponding to the parallel and the antiparallel 
spin orientations of the A and the spin of the nucleon group. 


In the actual calculations, the so-called ‘‘ Projection Theorem ’’s 
furnishes a powerful tool, in that it relates in our model, the matrix 
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elements of two-particle operators like the interaction energy for the many- 
particle wave functions in terms of the simpler matrix elements for two- 
particle configurations and coefficient of fractional parentage. The method 
is now useful in those cases where the expression in terms of the fractional 
parentage coefficients have a summable form so that closed expressions result. 
Fortunately all the hyperfragments that we discuss have this structure and 
so one can work with these closed expressions. The details and derivations 
may be found elsewhere. ; 


There are four S-shell hyperfragments which have been identified ,He%, 
zH4, ,He*, ,He®. The first and last are isotopic singlets and the other two 
form a doublet ; they correspond to the deuteron, the triton, He* and the 
alpha particle. The experimental! values for the binding energies are respec- 
tively 0-12 Mev., 2-20 Mev., 2:36 Mev. and 3-08 Mev. There are three 
parameters*® occurring in the expression for the A-binding energy, namely, 
the one-particle energy of the hyperon (T,,) and the singlet and triplet matrix 
elements (Sy, S,) of the (charge-independent) N — A interaction for the 
configuration The possible spin assignments and theoretical expres- 
sions for the binding energies in terms of these parameters deduced from the 
binding energies for the antiparallel spins are T, = 8-28 Mev., Sp = — 4-88 
Mev., S,; = — 2:16 Mev. While charge independence demands that ,H* 
and ,He* shoyld give the same binding energy, experiments give different 
values agreeing within experimental error ; in computing the parameters 
for the model, we have averaged over these two values. For the parallel 
spin configuration the model parameters cannot be found since one notices 
from Table I that the expressions for the binding energies form a degenerate 
set which is inconsistent with the experimental values. This data thus indi- 
cates antiparallel configuration of the hyperon spin and the spin of the 
nucleon group and in particular demands that ,He* has spin 0. This last 
result is particularly important since the large rate of the -reaction 
K- + ,H* + then argues for a pseudoscalar kaon.* This large 
rate makes it irrelevant to know if there is a loosely bound excited state® of 
spin | for ,He*; but from the parameters deduced above and the expressions 
for the binding energies, according to Table I, one finds that there are no bound 
excited states for any of the S-shell hyperfragments. One also notices that 
the A — N interaction is definitely spin-dependent and attractive and the 
ratio S,/S,~2 is consistent with the previous estimates by Dalitz and Downs.’ 
It turns out that the systems mnA and ,He® are unbound. 


A similar analysis can be carried through for P-shell hyperfragments. 
There are thirteen hyper nuclides belonging to the P3,.-shell and among them 
there are a sufficiently large number of events’ on four (,Li’, ,Li®, ,Be®, 
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TABLE | 


S-Shell A- binding energy in terms of two-particle interaction parameters 
and the hyperfragment spin assignments 


(A =3, 4, 5) 


Coefficients of 


Hyperfragment 


,H? } (antiparallel) 


3/2 (parallel) 


al’, ,He* 0 (antiparallel) 
1 (parallel) 


4 


aHe5 


,Li®) and the parameters P,’, P,’, (defining the A — N interaction for the 
configuration S,,., Ps,. for states with J = 2, | respectively) are deduced from 
a least-squares fit to those. In this case it is advantageous to subtract the 
binding energy of the ,He® hyperfragment so that the one-particle contri- 
bution disappears. The analysis is too lengthy to be reproduced here but the 
comparison between theory and experiment for the binding energies is given 
in Table Il. Using these parameters one can predict the first excited states 
of these hyperfragments and the binding energies of the other P3,9-shell 
hyperfragments. A comparison of the latter prediction with experiment 
is given in Table IIf. One notices that while the fit is satisfactory, it is difficult 
to differentiate between the parallel and the anti-parallel configurations on 
the basis of binding energy data alone. 


The same model may be applied to calculate other properties; for example 
one may develop a projection formula for the magnetic moments of hyper- 
fragments,* but these do not appear to be amenable to immediate measurement. 
More useful would be a correlation of the decay dynamics. For mesic decay 
modes, especially in the lighter hyperfragments, pion rescattering corrections 
may be ignored in a first approximation; and it is then possible to write the 
weak decay interaction as a two-particle operator with the pion wave function 
treated as an “external” field ; this calculation is quite analogous to the 
treatment of radiation processes in atomic systems to lowest order. 


J 

S, S, 
1 0 2 
3/2 3/2 

1 5/2 
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TABLE II 


Comparison between theory and experiment for the binding energies of Psy. 
shell hyperfragments (all energies given in Mev.) 


**Reduced” Theory 
Hyperfragment Binding 
Energy Anti-parallel spins Parallel spins 


—2+38 —2-18 —2+10 
—3-03 —3-16 —3+14 
Be® —3-52 —3-49 —3-53 
—4-12 —4-14 —4-19 


TABLE III 


Predicted and measured binding energies for rare P-shell hyper fragments 


Theory 
Antiparallel Parallel 


Hyperfragment Experiment 


3-0 4:73 4:84 
6-6 6-24 6-22 
9-9 9-18 9-37 
9-8 9°73 9-75 
10-8 - 10-10 10-10 


For non-mesic decays also one can (to a poorer approximation) use a two- 
particle decay operator. These calculations are now being completed, but 
our object in mentioning them is to point out that these are tests of the model 
independent of binding energy determinations, though there is no guarantee 
that the model should be adequate for decay processes ; again, the situation 
as far as beta and gamma transition matrix elements in complex nuclei fur- 
nishes a parallel. 
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One might at this point raise the question of the validity of a shell model 
with two-particle interactions only,® even as a first approximation in such 
a region of low mass numbers ; one may also point out that in the case of the 
A —N interaction, the three body forces N—A—N brought about by 
the exchange of a pion with each of the nucleons may be important.?. With 
regard to these points one has to remember that in setting up the model one 
has tacitly assumed the existence of sufficiently bound configuration of the 
particles so that the possibility of finding a particle far from the rest of the 
nucleus is fairly small ; and a substantial part of the many-body forces (which 
are equally well present in ordinary nuclei) is already used up in setting up 
these one-particle eigen functions. In this sense, we are working not so 
much with two-particle interactions but rather with a two-particle correlation 
approximation. It is perhaps unfortunate that we are thus deprived of 
knowing the exact two-particle interaction, but then the interaction parameters 
that we find are the combinations relevant for the structure of the many-body 
system. The general effect of many-particle correlations is to prevent the 
two-particle interaction parameters from having any significant dependence 
on the two-body potential and has thus the same qualitative nature as the 
Pauli principle and the “‘ hard” core ; in view of these we believe that the 
present model is a fair representation of the physical situation and that the 
interaction parameters are suitable averages of the true two-particle interaction 
in the neighbourhood of these average separation. We hope to develop 
these considerations in a more quantitative fashion so as to make more speci- 
fic inferences about the A — N potential. 


However, a test of the degree of validity of the model for hyperfragments 
is suggested®: one could see how good is a similar model for the corresponding 
S and P shell nuclei. (Usually this region of atomic numbers is consi- 
dered ‘‘ off limits” for the shell-model!) We have three binding energies 
tor the S-shell: the deuteron, triton He* and alpha particle. With a notation 
similar to that used for hyperfragments, we have 


2Tx + S,’ 2:23 Mev. 


4Ty + 3S,’ + 3S)’ = — 28-30 Mev. 


From which we get Ty = + 5-67 Mev., S;’ = — 13-56 Mev. and S,’ = — 3-43 
Mev. Notice that the one-particle energy is smaller here (corresponding to 
a much stronger binding) as compared with the value obtained for T,. If 
we now try to predict the virtual singlet level of the n — p system we find 
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that its ‘* excitation energy ” is far too high ; it then appears that the model 
fails here. But the reason for the failure is easily found; in contrast to the 
triton and the alpha particle the deuteron is a rather ‘“‘ open” structure. 
The wave functions are spread out much farther than the one-particle wave 
functions appropriate to the S-shell. The residual two-particle interaction 
has of course the one-pion tail in this case and the spread out wave functions 
exploit this attractive tail and get further bound. If this explanation is 
physical, then the relation between the interaction parameters and the one- 
particle energy for the deuteron should contain a smaller energy ; if we put 
for this binding energy the value zero, we get an altered set of parameters 
and these bring the predicted singlet level nearer zero energy. It is still too 
high, but this must also happen since the unbound singlet level should obtain 
more “ extra binding”. Similar studies have been made for the P3,. and shell 
nuclei also.® 1° And the fit to binding energies is to better than two per cent. 
The lesson of these studies is that for ‘“‘ compact ”’ (sufficiently bound) nuclei 
the model is adequate. 


This analysis now raises the question to what extent our predictions for 
the hyperfragments is modified by these ‘‘ extraordinary” corrections ; this 
is especially important since ,H® is only loosely bound and one might see 


if these corrections could remove the inconsistency for one parallel spin con- 
figurations. One verifies immediately that for an attractive tail the corrections 
go in the wrong direction (since for consistency we need the binding energy 
of about 0-5 Mev.); on the other hand, a repulsive tail behaves in a very 
different manner from an attractive tail and is relatively unimportant for a 
one-particle wave function which was sufficiently compact to start with. 
For the P-shell hyperfragments all of the binding energies are considerably 
larger. It is to be emphasized that the corrections are qualitatively expected 
to be much less important since no long range tail for the A —N interaction 
is believed to exist ; and the one-particle wave function being an extended 
wave function (corresponding to the large positive value of T,), the residual 
change in them is unimportant. 


This, then, is our model. The main conclusions derived from this ana- 
lysis are the spin dependence of the A — N interaction the zero spin of ,He* 
and the consequent odd parity of the Kaon ; and finally the comparative 
weakness of the A — N interaction and the consequent slowly varying extend- 
ed wave function of the A~- hyperon. This last point is particularly relevant 
in connection with the omission of pion rescattering corrections in correlating 
the decay dynamics. 
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1. INTRODUCTION 


THE well-known relativistic wave equation for an electron! was derived by 
Dirac by factorising a second order equation into two linear equations. 
It is the object of the present note to point out that the Dirac equation is 
simply the eigenvalue equation of the magnitude of the momentum four- 
vector, and that one can derive it by expressing the magnitude of the momentum 
vector in terms of its four components. 


The above idea enables one to generalise the Dirac equation and obtain 
a relativistic equation for systems containing several electrons. In Section 3, 
we have given the wave equation (Equation 16) for a system composed of 
several particles and this is very similar in form to the Dirac equation. 
Relativistic wave equations for a system of two electrons have previously 
been given by Eddington,* Gaunt* and Breit® of which the one given by Breit 
is the most satisfactory. By replacing the velocities v' and v" of the electrons 
by the spin matrices — ca’ and — ca" in a Hamiltonian given by Darwin,‘ 
Breit was able to obtain an approximate wave equation for two-electron 
systems. It is shown in Section 4, that Equation (16) leads to the Breit 
equation when it is represented in the product space of the two electrons. 


2. THE MOMENTUM FouR-VECTOR 


Before proceeding further, we first state a result which was first proved 
by Weyl® for a vector in a n-dimensional space and which we shall apply 
presently. 


(a) Lemma.—Let (x1, +++; Xn) be the co-ordinates of a vector 
in an Eucledian space with reference to a system of orthogonal axes, and 
let r = (x,2 + x.2 + .-- + Xn”)? denote the ‘magnitude’ or ‘length’ of 
the vector ,. Then 
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r= z (1) 


where the y,’s are elements of an abstract algebra satisfying the relations 


The y’s can be expressed as matrices and for the case n = 4, they are 
the Dirac matrices. 


(b) The Dirac Equation.—Let x,, x2, X3 and x, (= ict) be the co-ordinates 
of a world point, and similarly let p,, p2, p3 and p, (= iE/c) denote the com- 
ponents of the momentum four-vector. Besides x, and p,, we shall also 
use the symbols x, and py, given by x, = ix, and p, = ip,. Expressed as 
operators we have then py = — ifd/dxq and py = ihd/dxp. 


Now the momentum four-vector P is a vector with constant magnitude 
imac where mg is the rest mass of the particle. Apply now the result (1) 


> 
to the vector P = (p,, ps, ps, ps). We then get 
ime. (2) 


If | 4) is an eigenstate of the momentum four-vector, we get from (2) 
the equation of an electron as 


(ivapo + vipi — |) = 0. 


For the y’s we now choose the following representations :— 
0 ids 
= ios 0 


where I, c,, 7, and o, denote respectively the unit matrix in two dimensions 
and the three Pauli matrices. 


Multiplying (3) to the left by — iy,*, we get 
(2. + z api + rec ) =0 (4) 
which is the Dirac equation in its conventional form. 


When the electron is moving in a field, the components of the momentum 
four-vector are given by (pi + e/c Ai) (i = 0, 1, 2, 3) where Ag, Aj, Ag, Ay 
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are the scalar and vector potentials of the field. From the relations z (pi + 


elc Aj)® = — m,2c*, we see that the magnitude of the momentum four- 
vector is equal to imoc in this case also. Thus by replacing p; by (pj + e/c Aj) 
in (4), we get the equation for an electron moving in field as 


The left-hand side of (2) is similar in form to the expression of the length 
of a vector in terms of its direction cosines. The y’s can thus be regarded 
as the representations in a matrix algebra of the direction cosines of the 
momentum four-vector. We have thus 


Pi 
> 
|P | 


= Ui, (6) 


where u; (i = 1, 2, 3,4) are the components of the velocity four-vector. The 
matrices iy; thus represent the components of the velocity four-vector. 


Since 


a, = — i (yy)? (i = 1, 2, 3), 
we have 


_ _ 


= 


Pa 


= — 


Similarly 


p= (8) 


We thus get the well-known expressions for the components of the velocity 
of the particle without calculating the commutation relations of x,, x. and 
X, with the Hamiltonian. 


In finding out the dynamical variables that are the classical analogues 
of products (or quotients) of matrices, care should be taken to verify that 
only the algebraical rules that are common to both the matrix and the ordi- 
nary algebras are used. We have derived (4) by multiplying (2) by — iy? 
making use of the relation (y,)~’ y, = 1 which is common to both algebras, 
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though one could equally well derive (4) by multiplying (2) by — iy,. The 
classical analogue of a; is thus — i(y4)~'yq and not — iy, 7%. 


3. SYSTEM OF MANY ELECTRONS 


We have seen that the Dirac equation can be derived by expressing the 


magnitude of the four-vector P + e/c A in terms of its components. Now 


(Pi Ai) = meu 


where w;, Mo, Mg, Uy are the components of the velocity four-vector. 
natural way to generalise the Dirac equation for a system of particles would 
be to consider a four-vector whose components are respectively Y mcuj 
(i = 1, 2, 3, 4, or 0) where the sum is to be taken over all the particles of 
the sysiem. We shall denote the components of this vector by (Pp + e/c Ao), 
(P, + e/c Ay), + Ag) and (P + e/c Ag). 


Now in the special theory of reiativity the components of the total 
momentum are given by’ 


f -+ 2 myers, (10) 


where Tjz.. are the components of the energy-momentum tensor. In three- 
dimensional form, we can write for the total momentum of field p/us charges 


(10 a) 


and for the energy 
+ Ze, (10 5) 


where 
c 
S= EXxH 
is the Poynting vector and 


W = (E* + HY) 


| 
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is the density of the field energy. For a system of charges, the electrostatic 
energy (U), apart from the self-energy terms, is equal to 


U= f wav = 
rap 
We thus see that the quantities 
— (Aoy Ais Aas As) 
stand respectively for the energy-momentum of the field. 


Let us denote the magnitude of P+ elc A by iP’. P’ is equal to 
mpc only as a first approximation. We have in fact 
4 


— Pp? =—c (z +3(3 mixx) (11) 


k=1 \i=1 


4,4 


i<j 


Pp’? = J (1 + 22 mymjc* (1 


(1 
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where 
Vij = (Vi — Yj). 


Substituting (13) in (12), we get 
2 vi” 
P Z + 22 (1+ 
= (2 mac)? + mymojri;?. 
Thus a second approximation for P’ is given by 


MoilNg 


Pp’ Myc + 4 (15) 


Applying now the results of Section 1 to the vector P + e/c A, we get the 
equation for the system of electrons as 


k=1 


4. REPRESENTATION IN THE PRODUCT SPACE AND 
THE BREIT EQUATION 


Equation (16) describes the system as a whole and can be considered 
to be the equation of motion for the centre of inertia of the system. Since 
we are operating in a four-dimensional space, the matrices a, and § are all 
four-dimensional. They refer to the entire-system of electrons and do not 
contain any labels of the individual particles. In practice, however, one 
needs equations that bring in explicitly the positions and spins of the individual 
electrons of the system. We shall see presently that Equation (16) can be 
transformed into one that contains explicit reference to the spins of the 
electrons if it is represented in the product space of the electrons. 


Since the y’s represent the direction cosines of the momentum four- 
vector, their classical analogues are given by 


MiXik 


Tl 
(14) 
ar 
T 
n= (k = 1, 2, 3) 
and 
(2 imic) 17 
a 
% iP’ (17 a) 


The Dirac Equation for Many-Electron Systems 


Thus we have 


mx 
an = — i ve = — 


Thus we have 


+ Ak) + AP’ 


ke1 


c m; 


i=1 iz1 


correct to terms of the order of v;4/c’. 


Alternatively, one can get (18) without using the idea of direction 
cosines at all. From (16) we have 


kel 


and by definition, 


— (P,+£A,) = mic; 
this leads now to the right-hand of (18). 


For an electron moving in a field, we have 


mv =P 


and 
d 
€ 
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where P is the momentum conjugate to the position of the particle and A 
is the vector potential of the field. Thus we have 


mv? = (Pi At). (19) 


The term e/c 2 v;- A’ gives the interaction energy of the charges with 
the magnetic field produced by the motion of the electrons. 


Now the potentials arising from the motion of a charge e have been 
worked out in Landau and Lifshitz’? and these are given by 


where r is the distance of the charge from the field point and m = r/r. When 
there are several charges we must sum over all the charges. Thus the potentials 
acting at the position of charge 1 due to the motions of charges 2, 3, ---,n 
are given by 


Vv; Vs ° Nya) 


j=2 


Let us suppose that our system consists of n electrons moving in a static 
electric field V (r) (the field of the nuclei). We can write the potential energy 
of the system as 


1 
% = — 5 


rij . 
The factor 4 in the above expression for ¢; is introduced to take care that 
the interaction energy between two electrons is not counted twice. The 


energy is thus expressible as the sum of n different terms, each term standing 
for the energy density of a particle. Consider now ¢, We have 


tc 
a 
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j=2 
and 
where 
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4=V(n)—5 ry? 
j 


this can be interpreted as the potential at the field point of electron 1 due 
to the static field V (r) and the fields of (n — 1) moving charges, each having 
a charge of — ¢/2. Thus the vector potential A’ that arises as a consequence 
of the motion of electrons 2, 3, ..., can be obtained by multiplying (20 5) 
by —4. We have 


= V(r ry? 
4cr,; 
j 
In general we have 
4cri; 


Substituting (21) in (19) we get 


= vi- Pi 


rij 


Substituting (22) in (18) we get 


Dy (Pe Ax) + 
2 


Vi (ve rag) (Vi Fig) 


e2 
+ 
We have seen that the velocity of the electron is related to the spin matrices 
by means of the relations 


v=— 
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Let us therefore introduce a set of matrices a‘ and f* by means of the 
relations 


(24) 


A representation for the matrices a‘ and f in the product space of the elec- 
trons can be obtained by adopting the following rule®: If two physical systems 
a and 5 are compounded to form a total system c, then the system space 
H of ¢c is RXG where R is the system space of a and G of b. In the system 
c obtained by composition, a Hermitean form AxI, is associated with a 
quantity a of a and I, xB with 8 of b where A and B are the forms associated 
with a, B in R, G respectively, and I, and I, are the unit forms in R and G. 


In the product space, therefore, the matrices a‘ and B are given by 


a’ = IxIxl... xI 


and 


pi (25) 


In the above, the product contains m terms, and aand § occur in the i-th 
place; further the x denotes direct product multiplication. Substituting 
(25), (24) and (23) in (16), we get the equation for the system as 


For the case n = 2, the above equation reduces to the well-known Breit 
equation, 


fo 
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SUMMARY 


It is shown that the Dirac equation for an electron moving in a field 
can be derived by expressing the magnitude of the momentum four-vector 
in terms of its components. By considering a four-vector whose components 
denote the total momentum and energy of the particles, a relativistic equation 
for a system of several electrons has been derived. A representation of 
this equation has been made in the product space of the electrons and it is 
shown that for the special case of a system containing two electrons, it leads 
to the well-known Breit equation. 
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